Learning to identify video shots with people based on face detection

نویسندگان

  • Rong Jin
  • Alexander G. Hauptmann
چکیده

We examine how to identify video shots with at least two humans using only detected face information. While face detection is much more reliable than shape based people classification in broadcast video, one particular difficulty is that, when there are several humans in an image, the accuracy of face detection is usually significantly degraded, which leads to poor performance in identifying shots of ‘people’. Furthermore, while our standard face detector works from individual still images, we propose using the statistics of face information of images within a whole shot as additional evidence in deciding whether or not a video shot belongs to the ‘people’ category. Empirically, we studied which statistics of face information are more informative than others and how to combine different statistics together in order to achieve better prediction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Face Detection Method Based on Over-complete Incoherent Dictionary Learning

In this paper, face detection problem is considered using the concepts of compressive sensing technique. This technique includes dictionary learning procedure and sparse coding method to represent the structural content of input images. In the proposed method, dictionaries are learned in such a way that the trained models have the least degree of coherence to each other. The novelty of the prop...

متن کامل

Neural Network Performance Analysis for Real Time Hand Gesture Tracking Based on Hu Moment and Hybrid Features

This paper presents a comparison study between the multilayer perceptron (MLP) and radial basis function (RBF) neural networks with supervised learning and back propagation algorithm to track hand gestures. Both networks have two output classes which are hand and face. Skin is detected by a regional based algorithm in the image, and then networks are applied on video sequences frame by frame in...

متن کامل

Audio-visual synchrony for detection of monologues in video archives

In this paper we present our approach to detect monologues in video shots. A monologue shot is defined as a shot containing a talking person in the video channel with the corresponding speech in the audio channel. Whilst motivated by the TREC 2002 Video Retrieval Track (VT02), the underlying approach of synchrony between audio and video signals are also applicable for voice and face-based biome...

متن کامل

The Effect of Web-based Flipped Classroom Approach on Learning and Satisfaction of Medical Students Comparison with Lecture-based Method

Introduction: Student-centered educational models, such as Flipped classrooms, seem to provide more educational opportunities for learners, especially when combined with web technology. This study aimed to evaluate the effectiveness and satisfaction of medical students with the web-based Flipped classroom method in comparison with the lecture-based teaching method. Method: This is a quasi-exper...

متن کامل

Unsupervised Approach for Retrieving Shots from Video

Acquiring the video information based on user requirement is an important research, that attracts the attention of most of the researchers today. This paper proposes an unsupervised shot transition detection algorithm using Autoassociative Neural Network (AANN) for retrieving video shots. The work further identifies the type of shot transition, whether abrupt or gradual. Keyframes are extracted...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003